Query Interface Integrator For Domain Specific Hidden Web
نویسندگان
چکیده
Web is title admittance today mainly relies on search engines. A large amount of data is hidden in the databases behind the search interfaces referred to as “Hidden web”, which needs to be indexed so in order to serve user’s query. In this paper database and data mining techniques are used for query interface integration (QII). The query interface must resemble the look and feel of local interface as much as possible despite being automatically generated without human support.This technique keeps the related documents in the same domain so that searching of documents becomes more efficient in terms of time complexity.
منابع مشابه
Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملMulti-objective optimization integration of query interfaces for the Deep Web based on attribute constraints
Article history: Received 1 September 2011 Received in revised form 25 December 2012 Accepted 7 January 2013 Available online 16 January 2013 In order to query and retrieve the rich and useful information hidden in the DeepWeb efficiently, extensive research on domain-specific Deep Web Data Integration Systems (DWDIS) has been carried out in recent years. In DWDIS, large-scale automatic integra...
متن کاملA Novel Approach to Integrated Search Information Retrieval Technique for Hidden Web for Domain Specific Crawling
The traditional web crawlers retrieve contents from only the “Surface web” and are unable to crawl through the hidden portion of the Web containing high quality information which is dynamically generated through querying databases when the queries are submitted through a search interface. For Hidden web, most of the published research has been done to identify/detect such searchable forms and m...
متن کاملAn Improved Extraction Algorithm from Domain Specific Hidden Web
The web contains a large amount of information which is increasing by magnitude every day. The World Wide Web consists of Surface Web (Publicly Indexed Web) and the Deep Web which consists of Hidden Data, alsoreferred to by different names such as Hidden Web, Deepnet or the Invisible Web. A user can directly access the surface web through a Search Engine but to access the hidden data/informatio...
متن کاملA Novel Approach for Automatic Detection and Unification of Web Search Query Interfaces Using Domain Ontology
A large amount of information on the Web, stored behind search interfaces cannot be indexed by general-purpose search engines as it is dynamically generated through querying databases. Such databases called Hidden Web or Deep Web contains high quality information. Deep web contents are generated only when queries are asked via a search interface, rendering interface integration a critical probl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1311.4900 شماره
صفحات -
تاریخ انتشار 2013